Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 442 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 34.7 KiB |
| Average record size in memory | 80.3 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 1 |
s1 is highly correlated with s2 and 2 other fields | High correlation |
s2 is highly correlated with s1 and 1 other fields | High correlation |
s3 is highly correlated with s4 | High correlation |
s4 is highly correlated with s1 and 3 other fields | High correlation |
s5 is highly correlated with s1 and 1 other fields | High correlation |
s1 is highly correlated with s2 and 2 other fields | High correlation |
s2 is highly correlated with s1 and 1 other fields | High correlation |
s3 is highly correlated with s4 | High correlation |
s4 is highly correlated with s1 and 3 other fields | High correlation |
s5 is highly correlated with s1 and 1 other fields | High correlation |
s1 is highly correlated with s2 | High correlation |
s2 is highly correlated with s1 and 1 other fields | High correlation |
s3 is highly correlated with s4 | High correlation |
s4 is highly correlated with s2 and 1 other fields | High correlation |
s1 is highly correlated with s2 and 2 other fields | High correlation |
s2 is highly correlated with s1 and 1 other fields | High correlation |
s3 is highly correlated with s4 | High correlation |
s4 is highly correlated with s1 and 3 other fields | High correlation |
s5 is highly correlated with s1 and 1 other fields | High correlation |
Reproduction
| Analysis started | 2022-04-28 14:56:39.423862 |
|---|---|
| Analysis finished | 2022-04-28 14:57:01.210052 |
| Duration | 21.79 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
age
Real number (ℝ)
| Distinct | 58 |
|---|---|
| Distinct (%) | 13.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -3.638994586 × 10-16 |
| Minimum | -0.1072256316 |
|---|---|
| Maximum | 0.1107266755 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 202 |
| Negative (%) | 45.7% |
| Memory size | 3.6 KiB |
Quantile statistics
| Minimum | -0.1072256316 |
|---|---|
| 5-th percentile | -0.0854304009 |
| Q1 | -0.03729926643 |
| median | 0.005383060374 |
| Q3 | 0.03807590643 |
| 95-th percentile | 0.07076875249 |
| Maximum | 0.1107266755 |
| Range | 0.2179523071 |
| Interquartile range (IQR) | 0.07537517286 |
Descriptive statistics
| Standard deviation | 0.04761904762 |
|---|---|
| Coefficient of variation (CV) | -1.30857704 × 1014 |
| Kurtosis | -0.6712236886 |
| Mean | -3.638994586 × 10-16 |
| Median Absolute Deviation (MAD) | 0.03632538451 |
| Skewness | -0.231381533 |
| Sum | -1.609823386 × 10-13 |
| Variance | 0.002267573696 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.01628067573 | 19 | 4.3% |
| 0.04170844488 | 17 | 3.8% |
| 0.009015598825 | 16 | 3.6% |
| -0.02730978568 | 15 | 3.4% |
| -0.001882016528 | 14 | 3.2% |
| -0.05273755484 | 14 | 3.2% |
| 0.04534098334 | 14 | 3.2% |
| 0.01264813728 | 14 | 3.2% |
| 0.06713621404 | 13 | 2.9% |
| 0.005383060374 | 13 | 2.9% |
| Other values (48) | 293 |
| Value | Count | Frequency (%) |
| -0.1072256316 | 3 | 0.7% |
| -0.1035930932 | 3 | 0.7% |
| -0.09996055471 | 2 | 0.5% |
| -0.09632801625 | 4 | |
| -0.0926954778 | 4 | |
| -0.08906293935 | 3 | 0.7% |
| -0.0854304009 | 5 | |
| -0.08179786245 | 2 | 0.5% |
| -0.078165324 | 4 | |
| -0.07453278555 | 8 |
| Value | Count | Frequency (%) |
| 0.1107266755 | 2 | 0.5% |
| 0.09619652165 | 2 | 0.5% |
| 0.0925639832 | 1 | 0.2% |
| 0.08893144475 | 1 | 0.2% |
| 0.0852989063 | 1 | 0.2% |
| 0.08166636785 | 5 | 1.1% |
| 0.07803382939 | 1 | 0.2% |
| 0.07440129094 | 6 | |
| 0.07076875249 | 7 | |
| 0.06713621404 | 13 |
sex
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 32.5 KiB |
| -0.044641636506989 | |
|---|---|
| 0.0506801187398187 |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 18 |
| Min length | 18 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0506801187398187 |
|---|---|
| 2nd row | -0.044641636506989 |
| 3rd row | 0.0506801187398187 |
| 4th row | -0.044641636506989 |
| 5th row | -0.044641636506989 |
Common Values
| Value | Count | Frequency (%) |
| -0.044641636506989 | 235 | |
| 0.0506801187398187 | 207 |
Length
Histogram of lengths of the category
Pie chart
| Value | Count | Frequency (%) |
| 0.044641636506989 | 235 | |
| 0.0506801187398187 | 207 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
bmi
Real number (ℝ)
| Distinct | 163 |
|---|---|
| Distinct (%) | 36.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -8.022742852 × 10-16 |
| Minimum | -0.0902752959 |
|---|---|
| Maximum | 0.170555226 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 247 |
| Negative (%) | 55.9% |
| Memory size | 3.6 KiB |
Quantile statistics
| Minimum | -0.0902752959 |
|---|---|
| 5-th percentile | -0.06656343027 |
| Q1 | -0.03422906806 |
| median | -0.00728376621 |
| Q3 | 0.03124801543 |
| 95-th percentile | 0.08540807214 |
| Maximum | 0.170555226 |
| Range | 0.2608305219 |
| Interquartile range (IQR) | 0.06547708349 |
Descriptive statistics
| Standard deviation | 0.04761904762 |
|---|---|
| Coefficient of variation (CV) | -5.935507157 × 1013 |
| Kurtosis | 0.09509447428 |
| Mean | -8.022742852 × 10-16 |
| Median Absolute Deviation (MAD) | 0.03125655014 |
| Skewness | 0.5981484879 |
| Sum | -3.547162564 × 10-13 |
| Variance | 0.002267573696 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -0.02452875939 | 8 | 1.8% |
| -0.03099563184 | 8 | 1.8% |
| -0.008361578284 | 7 | 1.6% |
| -0.04608500087 | 7 | 1.6% |
| -0.02560657147 | 7 | 1.6% |
| 0.001338730381 | 6 | 1.4% |
| 0.004572166603 | 6 | 1.4% |
| 0.01427247527 | 6 | 1.4% |
| -0.0202175111 | 6 | 1.4% |
| -0.02345094732 | 6 | 1.4% |
| Other values (153) | 375 |
| Value | Count | Frequency (%) |
| -0.0902752959 | 1 | |
| -0.08919748382 | 1 | |
| -0.08488623553 | 1 | |
| -0.08380842346 | 1 | |
| -0.08165279931 | 2 | |
| -0.08057498723 | 1 | |
| -0.07949717516 | 1 | |
| -0.07734155101 | 2 | |
| -0.07626373894 | 1 | |
| -0.07518592686 | 1 |
| Value | Count | Frequency (%) |
| 0.170555226 | 1 | |
| 0.1608549173 | 1 | |
| 0.1371430517 | 1 | |
| 0.1285205551 | 1 | |
| 0.127442743 | 1 | |
| 0.1252871189 | 1 | |
| 0.1231314947 | 1 | |
| 0.1145089981 | 1 | |
| 0.1112755619 | 1 | |
| 0.1101977498 | 1 |
bp
Real number (ℝ)
| Distinct | 100 |
|---|---|
| Distinct (%) | 22.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.279849153 × 10-16 |
| Minimum | -0.1123996021 |
|---|---|
| Maximum | 0.1320442172 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 244 |
| Negative (%) | 55.2% |
| Memory size | 3.6 KiB |
Quantile statistics
| Minimum | -0.1123996021 |
|---|---|
| 5-th percentile | -0.07435588089 |
| Q1 | -0.0366564468 |
| median | -0.005670610555 |
| Q3 | 0.03564383777 |
| 95-th percentile | 0.08367188395 |
| Maximum | 0.1320442172 |
| Range | 0.2444438193 |
| Interquartile range (IQR) | 0.07230028457 |
Descriptive statistics
| Standard deviation | 0.04761904762 |
|---|---|
| Coefficient of variation (CV) | 3.720676575 × 1014 |
| Kurtosis | -0.5327797228 |
| Mean | 1.279849153 × 10-16 |
| Median Absolute Deviation (MAD) | 0.03442870694 |
| Skewness | 0.2906638512 |
| Sum | 5.684341886 × 10-14 |
| Variance | 0.002267573696 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -0.04009931749 | 21 | 4.8% |
| -0.005670610555 | 21 | 4.8% |
| -0.02632783472 | 20 | 4.5% |
| 0.02187235499 | 15 | 3.4% |
| -0.0332135761 | 14 | 3.2% |
| -0.02288496402 | 13 | 2.9% |
| -0.01599922264 | 11 | 2.5% |
| 0.00810087222 | 11 | 2.5% |
| -0.01255635194 | 11 | 2.5% |
| 0.04941532054 | 11 | 2.5% |
| Other values (90) | 294 |
| Value | Count | Frequency (%) |
| -0.1123996021 | 1 | 0.2% |
| -0.1089567314 | 1 | 0.2% |
| -0.10207099 | 1 | 0.2% |
| -0.1009233664 | 1 | 0.2% |
| -0.09862811929 | 1 | 0.2% |
| -0.08485663651 | 4 | |
| -0.08141376582 | 4 | |
| -0.07797089512 | 1 | 0.2% |
| -0.07452802443 | 9 | |
| -0.07108515374 | 1 | 0.2% |
| Value | Count | Frequency (%) |
| 0.1320442172 | 1 | 0.2% |
| 0.1251584758 | 1 | 0.2% |
| 0.1079441223 | 3 | |
| 0.1045012516 | 2 | 0.5% |
| 0.101058381 | 1 | 0.2% |
| 0.09876313371 | 1 | 0.2% |
| 0.09761551026 | 5 | |
| 0.09417263956 | 1 | 0.2% |
| 0.09072976887 | 2 | 0.5% |
| 0.08728689818 | 4 |
| Distinct | 141 |
|---|---|
| Distinct (%) | 31.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -9.042540472 × 10-17 |
| Minimum | -0.1267806699 |
|---|---|
| Maximum | 0.1539137132 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 240 |
| Negative (%) | 54.3% |
| Memory size | 3.6 KiB |
Quantile statistics
| Minimum | -0.1267806699 |
|---|---|
| 5-th percentile | -0.07311850845 |
| Q1 | -0.0342478402 |
| median | -0.004320865537 |
| Q3 | 0.02835801485 |
| 95-th percentile | 0.08367131975 |
| Maximum | 0.1539137132 |
| Range | 0.2806943831 |
| Interquartile range (IQR) | 0.06260585505 |
Descriptive statistics
| Standard deviation | 0.04761904762 |
|---|---|
| Coefficient of variation (CV) | -5.26611385 × 1014 |
| Kurtosis | 0.2329479047 |
| Mean | -9.042540472 × 10-17 |
| Median Absolute Deviation (MAD) | 0.03095893931 |
| Skewness | 0.3781082069 |
| Sum | -3.963496198 × 10-14 |
| Variance | 0.002267573696 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -0.007072771253 | 10 | 2.3% |
| -0.03734373413 | 10 | 2.3% |
| 0.01219056876 | 9 | 2.0% |
| 0.02044628591 | 9 | 2.0% |
| 0.001182945896 | 8 | 1.8% |
| 0.02457414449 | 8 | 1.8% |
| -0.02496015841 | 8 | 1.8% |
| -0.004320865537 | 8 | 1.8% |
| -0.002944912678 | 8 | 1.8% |
| -0.009824676969 | 7 | 1.6% |
| Other values (131) | 357 |
| Value | Count | Frequency (%) |
| -0.1267806699 | 1 | |
| -0.1088932828 | 1 | |
| -0.1047654242 | 1 | |
| -0.1033894713 | 1 | |
| -0.1006375656 | 1 | |
| -0.09650970704 | 2 | |
| -0.0910058956 | 1 | |
| -0.08962994275 | 2 | |
| -0.08825398989 | 1 | |
| -0.08687803703 | 1 |
| Value | Count | Frequency (%) |
| 0.1539137132 | 1 | |
| 0.1525377603 | 1 | |
| 0.1332744203 | 1 | |
| 0.1277706089 | 2 | |
| 0.126394656 | 1 | |
| 0.1250187031 | 2 | |
| 0.1195148917 | 1 | |
| 0.1098832217 | 2 | |
| 0.1030034574 | 1 | |
| 0.09887559883 | 1 |
| Distinct | 302 |
|---|---|
| Distinct (%) | 68.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.303004964 × 10-16 |
| Minimum | -0.115613066 |
|---|---|
| Maximum | 0.1987879897 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 239 |
| Negative (%) | 54.1% |
| Memory size | 3.6 KiB |
Quantile statistics
| Minimum | -0.115613066 |
|---|---|
| 5-th percentile | -0.07271172671 |
| Q1 | -0.03035839726 |
| median | -0.003819065121 |
| Q3 | 0.02984439452 |
| 95-th percentile | 0.07946276829 |
| Maximum | 0.1987879897 |
| Range | 0.3144010556 |
| Interquartile range (IQR) | 0.06020279178 |
Descriptive statistics
| Standard deviation | 0.04761904762 |
|---|---|
| Coefficient of variation (CV) | 3.654556118 × 1014 |
| Kurtosis | 0.6013811504 |
| Mean | 1.303004964 × 10-16 |
| Median Absolute Deviation (MAD) | 0.0299056781 |
| Skewness | 0.4365918037 |
| Sum | 5.756506383 × 10-14 |
| Variance | 0.002267573696 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -0.001000728964 | 5 | 1.1% |
| 0.01622243643 | 5 | 1.1% |
| 0.056618588 | 4 | 0.9% |
| -0.02480001206 | 4 | 0.9% |
| -0.04703355285 | 4 | 0.9% |
| -0.0138398159 | 4 | 0.9% |
| -0.05454911593 | 3 | 0.7% |
| -0.02166852744 | 3 | 0.7% |
| 0.004635943348 | 3 | 0.7% |
| 0.03751653184 | 3 | 0.7% |
| Other values (292) | 404 |
| Value | Count | Frequency (%) |
| -0.115613066 | 1 | |
| -0.1127947298 | 1 | |
| -0.106844909 | 1 | |
| -0.1043397214 | 1 | |
| -0.1008950883 | 1 | |
| -0.09713730673 | 1 | |
| -0.09619786135 | 1 | |
| -0.09588471289 | 1 | |
| -0.09463211904 | 1 | |
| -0.09056118904 | 1 |
| Value | Count | Frequency (%) |
| 0.1987879897 | 1 | |
| 0.1558866504 | 1 | |
| 0.1314610704 | 1 | |
| 0.1302084765 | 1 | |
| 0.1280164373 | 1 | |
| 0.1273901404 | 1 | |
| 0.1251981011 | 1 | |
| 0.1170562411 | 1 | |
| 0.1164299442 | 1 | |
| 0.1089143811 | 1 |
| Distinct | 63 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -4.558319534 × 10-16 |
| Minimum | -0.1023070505 |
|---|---|
| Maximum | 0.1811790604 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 243 |
| Negative (%) | 55.0% |
| Memory size | 3.6 KiB |
Quantile statistics
| Minimum | -0.1023070505 |
|---|---|
| 5-th percentile | -0.06549067248 |
| Q1 | -0.03511716059 |
| median | -0.006584467611 |
| Q3 | 0.02931150098 |
| 95-th percentile | 0.07790911999 |
| Maximum | 0.1811790604 |
| Range | 0.2834861109 |
| Interquartile range (IQR) | 0.06442866157 |
Descriptive statistics
| Standard deviation | 0.04761904762 |
|---|---|
| Coefficient of variation (CV) | -1.044662342 × 1014 |
| Kurtosis | 0.9815074614 |
| Mean | -4.558319534 × 10-16 |
| Median Absolute Deviation (MAD) | 0.03129392133 |
| Skewness | 0.7992551183 |
| Sum | -2.01505479 × 10-13 |
| Variance | 0.002267573696 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -0.01394774322 | 22 | 5.0% |
| -0.04340084565 | 19 | 4.3% |
| -0.03971920785 | 18 | 4.1% |
| -0.002902829807 | 15 | 3.4% |
| -0.03235593224 | 15 | 3.4% |
| -0.02131101883 | 15 | 3.4% |
| 0.008142083605 | 15 | 3.4% |
| -0.02867429444 | 15 | 3.4% |
| -0.006584467611 | 14 | 3.2% |
| 0.01550535921 | 14 | 3.2% |
| Other values (53) | 280 |
| Value | Count | Frequency (%) |
| -0.1023070505 | 1 | 0.2% |
| -0.09862541271 | 1 | 0.2% |
| -0.09126213711 | 1 | 0.2% |
| -0.08021722369 | 2 | 0.5% |
| -0.07653558589 | 5 | |
| -0.07285394808 | 5 | |
| -0.06917231028 | 7 | |
| -0.06549067248 | 6 | |
| -0.06180903467 | 7 | |
| -0.05812739687 | 8 |
| Value | Count | Frequency (%) |
| 0.1811790604 | 1 | 0.2% |
| 0.1774974226 | 1 | 0.2% |
| 0.1738157848 | 1 | 0.2% |
| 0.1590892336 | 1 | 0.2% |
| 0.151725958 | 1 | 0.2% |
| 0.1406810446 | 1 | 0.2% |
| 0.1333177689 | 1 | 0.2% |
| 0.1222728555 | 2 | |
| 0.1185912177 | 3 | |
| 0.1038646665 | 1 | 0.2% |
| Distinct | 66 |
|---|---|
| Distinct (%) | 14.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.862389292 × 10-16 |
| Minimum | -0.07639450375 |
|---|---|
| Maximum | 0.1852344433 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 288 |
| Negative (%) | 65.2% |
| Memory size | 3.6 KiB |
Quantile statistics
| Minimum | -0.07639450375 |
|---|---|
| 5-th percentile | -0.07639450375 |
| Q1 | -0.03949338287 |
| median | -0.002592261998 |
| Q3 | 0.03430885888 |
| 95-th percentile | 0.08076737006 |
| Maximum | 0.1852344433 |
| Range | 0.261628947 |
| Interquartile range (IQR) | 0.07380224175 |
Descriptive statistics
| Standard deviation | 0.04761904762 |
|---|---|
| Coefficient of variation (CV) | 1.232890939 × 1014 |
| Kurtosis | 0.4444016718 |
| Mean | 3.862389292 × 10-16 |
| Median Absolute Deviation (MAD) | 0.03690112088 |
| Skewness | 0.7353736479 |
| Sum | 1.696420782 × 10-13 |
| Variance | 0.002267573696 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -0.03949338287 | 128 | |
| -0.002592261998 | 108 | |
| 0.03430885888 | 68 | |
| 0.07120997975 | 33 | 7.5% |
| -0.07639450375 | 28 | 6.3% |
| 0.1081111006 | 13 | 2.9% |
| 0.1450122215 | 2 | 0.5% |
| -0.03764832683 | 2 | 0.5% |
| 0.01585829844 | 2 | 0.5% |
| -0.02141183364 | 2 | 0.5% |
| Other values (56) | 56 |
| Value | Count | Frequency (%) |
| -0.07639450375 | 28 | 6.3% |
| -0.07085933562 | 1 | 0.2% |
| -0.06938329078 | 1 | 0.2% |
| -0.05351580881 | 1 | 0.2% |
| -0.05167075276 | 1 | 0.2% |
| -0.05056371914 | 1 | 0.2% |
| -0.05019470793 | 1 | 0.2% |
| -0.04798064068 | 1 | 0.2% |
| -0.04724261826 | 1 | 0.2% |
| -0.03949338287 | 128 |
| Value | Count | Frequency (%) |
| 0.1852344433 | 1 | 0.2% |
| 0.1553445354 | 1 | 0.2% |
| 0.1450122215 | 2 | 0.5% |
| 0.1413221094 | 1 | 0.2% |
| 0.1302517732 | 1 | 0.2% |
| 0.1081111006 | 13 | |
| 0.09187460744 | 1 | 0.2% |
| 0.08670845052 | 1 | 0.2% |
| 0.08486339448 | 1 | 0.2% |
| 0.08080427118 | 1 | 0.2% |
| Distinct | 184 |
|---|---|
| Distinct (%) | 41.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -3.8280088 × 10-16 |
| Minimum | -0.1260973856 |
|---|---|
| Maximum | 0.13359898 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 230 |
| Negative (%) | 52.0% |
| Memory size | 3.6 KiB |
Quantile statistics
| Minimum | -0.1260973856 |
|---|---|
| 5-th percentile | -0.0721284546 |
| Q1 | -0.03324878725 |
| median | -0.001947634157 |
| Q3 | 0.03243322578 |
| 95-th percentile | 0.07904666678 |
| Maximum | 0.13359898 |
| Range | 0.2596963656 |
| Interquartile range (IQR) | 0.06568201303 |
Descriptive statistics
| Standard deviation | 0.04761904762 |
|---|---|
| Coefficient of variation (CV) | -1.243963902 × 1014 |
| Kurtosis | -0.1343658334 |
| Mean | -3.8280088 × 10-16 |
| Median Absolute Deviation (MAD) | 0.03314062486 |
| Skewness | 0.2917738324 |
| Sum | -1.694477891 × 10-13 |
| Variance | 0.002267573696 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| -0.01811826731 | 11 | 2.5% |
| -0.03075120986 | 10 | 2.3% |
| -0.04118038519 | 8 | 1.8% |
| -0.05140053526 | 7 | 1.6% |
| -0.02595242444 | 7 | 1.6% |
| -0.03324878725 | 7 | 1.6% |
| -0.01090443585 | 6 | 1.4% |
| -0.0006092541861 | 6 | 1.4% |
| -0.06117659509 | 6 | 1.4% |
| -0.02364455757 | 6 | 1.4% |
| Other values (174) | 368 |
| Value | Count | Frequency (%) |
| -0.1260973856 | 1 | 0.2% |
| -0.1043648208 | 1 | 0.2% |
| -0.1016435479 | 1 | 0.2% |
| -0.09643322289 | 4 | |
| -0.09393564551 | 1 | 0.2% |
| -0.08913686008 | 1 | 0.2% |
| -0.08682899322 | 2 | |
| -0.08238148326 | 2 | |
| -0.08023654025 | 1 | 0.2% |
| -0.07814091067 | 2 |
| Value | Count | Frequency (%) |
| 0.13359898 | 2 | |
| 0.1333957338 | 1 | |
| 0.1323726493 | 1 | |
| 0.1300806095 | 1 | |
| 0.1290194116 | 1 | |
| 0.120053382 | 1 | |
| 0.1193439942 | 1 | |
| 0.1063542767 | 1 | |
| 0.1041376114 | 1 | |
| 0.1032922649 | 1 |
s6
Real number (ℝ)
| Distinct | 56 |
|---|---|
| Distinct (%) | 12.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -3.400999944 × 10-16 |
| Minimum | -0.1377672257 |
|---|---|
| Maximum | 0.1356118307 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 224 |
| Negative (%) | 50.7% |
| Memory size | 3.6 KiB |
Quantile statistics
| Minimum | -0.1377672257 |
|---|---|
| 5-th percentile | -0.07563562197 |
| Q1 | -0.03317902609 |
| median | -0.0010776975 |
| Q3 | 0.0279170509 |
| 95-th percentile | 0.0817644408 |
| Maximum | 0.1356118307 |
| Range | 0.2733790564 |
| Interquartile range (IQR) | 0.06109607699 |
Descriptive statistics
| Standard deviation | 0.04761904762 |
|---|---|
| Coefficient of variation (CV) | -1.400148439 × 1014 |
| Kurtosis | 0.2369167379 |
| Mean | -3.400999944 × 10-16 |
| Median Absolute Deviation (MAD) | 0.0289947484 |
| Skewness | 0.2079166162 |
| Sum | -1.501021529 × 10-13 |
| Variance | 0.002267573696 |
| Monotonicity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) |
| 0.003064409414 | 22 | 5.0% |
| 0.01963283707 | 20 | 4.5% |
| 0.007206516329 | 20 | 4.5% |
| -0.0010776975 | 19 | 4.3% |
| -0.01350401824 | 16 | 3.6% |
| -0.01764612516 | 16 | 3.6% |
| -0.03835665973 | 15 | 3.4% |
| -0.05492508739 | 14 | 3.2% |
| -0.005219804415 | 14 | 3.2% |
| 0.01549073016 | 14 | 3.2% |
| Other values (46) | 272 |
| Value | Count | Frequency (%) |
| -0.1377672257 | 1 | 0.2% |
| -0.1294830119 | 2 | 0.5% |
| -0.1046303704 | 2 | 0.5% |
| -0.09634615654 | 2 | 0.5% |
| -0.09220404963 | 4 | |
| -0.08806194271 | 2 | 0.5% |
| -0.0839198358 | 3 | |
| -0.07977772888 | 4 | |
| -0.07563562197 | 4 | |
| -0.07149351505 | 5 |
| Value | Count | Frequency (%) |
| 0.1356118307 | 3 | |
| 0.1314697238 | 2 | |
| 0.1273276169 | 1 | 0.2% |
| 0.119043403 | 2 | |
| 0.1066170823 | 4 | |
| 0.09833286846 | 2 | |
| 0.09419076154 | 1 | 0.2% |
| 0.09004865463 | 2 | |
| 0.08590654771 | 4 | |
| 0.0817644408 | 4 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here. A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
First rows
| age | sex | bmi | bp | s1 | s2 | s3 | s4 | s5 | s6 | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0.038076 | 0.050680 | 0.061696 | 0.021872 | -0.044223 | -0.034821 | -0.043401 | -0.002592 | 0.019908 | -0.017646 |
| 1 | -0.001882 | -0.044642 | -0.051474 | -0.026328 | -0.008449 | -0.019163 | 0.074412 | -0.039493 | -0.068330 | -0.092204 |
| 2 | 0.085299 | 0.050680 | 0.044451 | -0.005671 | -0.045599 | -0.034194 | -0.032356 | -0.002592 | 0.002864 | -0.025930 |
| 3 | -0.089063 | -0.044642 | -0.011595 | -0.036656 | 0.012191 | 0.024991 | -0.036038 | 0.034309 | 0.022692 | -0.009362 |
| 4 | 0.005383 | -0.044642 | -0.036385 | 0.021872 | 0.003935 | 0.015596 | 0.008142 | -0.002592 | -0.031991 | -0.046641 |
| 5 | -0.092695 | -0.044642 | -0.040696 | -0.019442 | -0.068991 | -0.079288 | 0.041277 | -0.076395 | -0.041180 | -0.096346 |
| 6 | -0.045472 | 0.050680 | -0.047163 | -0.015999 | -0.040096 | -0.024800 | 0.000779 | -0.039493 | -0.062913 | -0.038357 |
| 7 | 0.063504 | 0.050680 | -0.001895 | 0.066630 | 0.090620 | 0.108914 | 0.022869 | 0.017703 | -0.035817 | 0.003064 |
| 8 | 0.041708 | 0.050680 | 0.061696 | -0.040099 | -0.013953 | 0.006202 | -0.028674 | -0.002592 | -0.014956 | 0.011349 |
| 9 | -0.070900 | -0.044642 | 0.039062 | -0.033214 | -0.012577 | -0.034508 | -0.024993 | -0.002592 | 0.067736 | -0.013504 |
Last rows
| age | sex | bmi | bp | s1 | s2 | s3 | s4 | s5 | s6 | |
|---|---|---|---|---|---|---|---|---|---|---|
| 432 | 0.009016 | -0.044642 | 0.055229 | -0.005671 | 0.057597 | 0.044719 | -0.002903 | 0.023239 | 0.055684 | 0.106617 |
| 433 | -0.027310 | -0.044642 | -0.060097 | -0.029771 | 0.046589 | 0.019980 | 0.122273 | -0.039493 | -0.051401 | -0.009362 |
| 434 | 0.016281 | -0.044642 | 0.001339 | 0.008101 | 0.005311 | 0.010899 | 0.030232 | -0.039493 | -0.045421 | 0.032059 |
| 435 | -0.012780 | -0.044642 | -0.023451 | -0.040099 | -0.016704 | 0.004636 | -0.017629 | -0.002592 | -0.038459 | -0.038357 |
| 436 | -0.056370 | -0.044642 | -0.074108 | -0.050428 | -0.024960 | -0.047034 | 0.092820 | -0.076395 | -0.061177 | -0.046641 |
| 437 | 0.041708 | 0.050680 | 0.019662 | 0.059744 | -0.005697 | -0.002566 | -0.028674 | -0.002592 | 0.031193 | 0.007207 |
| 438 | -0.005515 | 0.050680 | -0.015906 | -0.067642 | 0.049341 | 0.079165 | -0.028674 | 0.034309 | -0.018118 | 0.044485 |
| 439 | 0.041708 | 0.050680 | -0.015906 | 0.017282 | -0.037344 | -0.013840 | -0.024993 | -0.011080 | -0.046879 | 0.015491 |
| 440 | -0.045472 | -0.044642 | 0.039062 | 0.001215 | 0.016318 | 0.015283 | -0.028674 | 0.026560 | 0.044528 | -0.025930 |
| 441 | -0.045472 | -0.044642 | -0.073030 | -0.081414 | 0.083740 | 0.027809 | 0.173816 | -0.039493 | -0.004220 | 0.003064 |